Morpheme Segmentation in School-aged Children
نویسندگان
چکیده
One of the major questions in the cognitive science of language is how children learn the productive systems that encompass the grammar of our native languages. This paper addresses how children learn morphology, the systematic relationship between form and meaning. For example, the English plural can be made in one of many ways, most commonly with the addition of the suffix /-s/. While such patterns between sound and meaning are subject to exceptions, marking meanings through regular forms is one of many ways in which language is a productive, combinatorial system. There are at least two possible ways in which morphology could be learned. One possibility is that the learner acquires relationships between forms and their meanings by grouping words that share a common component of meaning and then determining what part of the form is used to signal this meaning. For example, to acquire the English plural, the learner would take known plural words in the lexicon (dogs, cats, bugs, buses) and extract the commonality that the majority of these words end in the letter ‘s’ to form the rule that ‘s’ marks plural. The other possibility is that the learner works in the opposite direction, observing that a number of words in the lexicon have commonalities of form and inferring a morphological or semantic relationship among words that sound similar. Discovering the regularities among forms, the learner can also determine the system that relates form to meaning. For example, the words dogs, cats, bugs, buses all end in ‘s’, and are therefore likely to be related in meaning. These routes to discovering morphological regularities have different advantages and disadvantages. Using semantic relations to discover regularities in form requires knowing the meanings of the words, so the learner would have to master several sets of word meanings before acquiring their morphological
منابع مشابه
Grammatical morphology in school-age children with and without language impairment: a discriminant function analysis.
PURPOSE The purpose of this study was to test Bedore and Leonard's (1998) proposal that a verb morpheme composite may hold promise as a clinical marker for specific language impairment (SLI) in English speakers and serve as an accurate basis for the classification of children with and without SLI beyond the preschool level. METHOD The language transcripts of 50 school-age children with SLI (M...
متن کاملCross-lingual Word Segmentation and Morpheme Segmentation as Sequence Labelling
This paper presents our segmentation system developed for the MLP 2017 shared tasks on cross-lingual word segmentation and morpheme segmentation. We model both word and morpheme segmentation as character-level sequence labelling tasks. The prevalent bidirectional recurrent neural network with conditional random fields as the output interface is adapted as the baseline system, which is further i...
متن کاملExtracting Morphemes without #
In unsupervised morphology learning algorithms (Goldsmith, 2000) (Cavar et al, 2004), it is assumed that words are already segmented by white space. However, it is somewhat obvious that white space between words are not pronounced in spoken English. While word boundaries are indicated by white spaces in written English, speakers of English do not pronounce anything particular for space or they ...
متن کاملMorfessor and Hutmegs: Unsupervised Morpheme Segmentation for Highly-inflecting and Compounding Languages
In this work, we announce the Morfessor 1.0 software package, which is a program that takes as input a corpus of raw text and produces a segmentation of the word forms observed in the text. The segmentation obtained often resembles a linguistic morpheme segmentation. In addition, we briefly describe the Hutmegs package, also publicly available for research purposes. Hutmegs contains semi-automa...
متن کاملUnsupervised Bilingual Morpheme Segmentation and Alignment with Context-rich Hidden Semi-Markov Models
This paper describes an unsupervised dynamic graphical model for morphological segmentation and bilingual morpheme alignment for statistical machine translation. The model extends Hidden Semi-Markov chain models by using factored output nodes and special structures for its conditional probability distributions. It relies on morpho-syntactic and lexical source-side information (part-of-speech, m...
متن کاملLinguistic Problems In Multilingual Morphological Decomposition
An algorithm for the morphological decomposition of words into morphemes is presented. The application area is information retrieval, and the purpose is to find morphologically related terms to a given search term. First, the parsing framework is presented, then several linguistic decisions are discussed: morpheme selection and segmentation, morpheme classes, morpheme grammar, allomorph handlin...
متن کامل